Corpus: hin_wikipedia_2014_10K

Other corpora

2.2.5 Most frequent word beginnings

The most frequent word beginnings as character N-grams for N=1...5 with Zipf's diagram


Zipf's diagram for word beginnings


Gnuplot diagram

Top Characters
word rank frequency n-gram
1 3156 स-
2 2633 प-
3 2296 क-
4 1922 म-
5 1647 ब-
Top Character Bigrams
word rank frequency n-gram
1 861 प्-
2 627 स्-
3 610 वि-
4 403 सं-
5 381 का-
Top Character Trigrams
word rank frequency n-gram
1 801 प्र-
2 143 स्ट-
3 142 स्व-
4 140 कार-
5 134 ब्र-
Top Character 4-Grams
word rank frequency n-gram
1 153 प्रत-
2 115 प्रा-
3 104 कार्-
4 72 भारत-
5 70 पूर्-
Top Character 5-Grams
word rank frequency n-gram
1 113 प्रति-
2 55 पूर्व-
3 48 भारती-
4 43 विश्व-
5 43 उत्तर-
489 msec needed at 2021-08-21 17:06